首页> 外文OA文献 >Exploiting temporal information for DCNN-based fine-grained object classification
【2h】

Exploiting temporal information for DCNN-based fine-grained object classification

机译:利用时间信息进行基于DCNN的细粒度对象分类

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Fine-grained classification is a relatively new field that has concentrated on using information from a single image, while ignoring the enormous potential of using video data to improve classification. In this work we present the novel task of video-based fine-grained object classification, propose a corresponding new video dataset, and perform a systematic study of several recent deep convolutional neural network (DCNN) based approaches, which we specifically adapt to the task. We evaluate three-dimensional DCNNs, two-stream DCNNs, and bilinear DCNNs. Two forms of the two-stream approach are used, where spatial and temporal data from two independent DCNNs are fused either via early fusion (combination of the fully-connected layers) and late fusion (concatenation of the softmax outputs of the DCNNs). For bilinear DCNNs, information from the convolutional layers of the spatial and temporal DCNNs is combined via local co-occurrences. We then fuse the bilinear DCNN and early fusion of the two-stream approach to combine the spatial and temporal information at the local and global level (Spatio-Temporal Co-occurrence). Using the new and challenging video dataset of birds, classification performance is improved from 23.1% (using single images) to 41.1% when using the Spatio-Temporal Co-occurrence system. Incorporating automatically detected bounding box location further improves the classification accuracy to 53.6%.
机译:细粒度分类是一个相对较新的领域,专注于使用来自单个图像的信息,而忽略了使用视频数据来改善分类的巨大潜力。在这项工作中,我们提出了基于视频的细粒度目标分类的新任务,提出了一个相应的新视频数据集,并对几种最近基于深度卷积神经网络(DCNN)的方法进行了系统研究,我们专门适应了该任务。我们评估三维DCNN,两流DCNN和双线性DCNN。使用两种形式的双流方法,其中通过早期融合(完全连接的层的组合)和后期融合(DCNN的softmax输出的串联)融合来自两个独立DCNN的空间和时间数据。对于双线性DCNN,来自空间和时间DCNN的卷积层的信息通过局部共现进行组合。然后,我们融合双线性DCNN和两流方法的早期融合,以在局部和全局级别(时空共现)组合时空信息。使用具有挑战性的新的鸟类视频数据集,使用时空共现系统时,分类性能从23.1%(使用单个图像)提高到41.1%。合并自动检测到的边界框位置可进一步将分类准确性提高到53.6%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号